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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Mixson, James A 

(ii) TITLE OF INVENTION: CARRIERNUCLEIC ACIDS COMPLEXES CONTAINING 
NUCLEIC ACIDS ENCODING ANTI-ANGIOGENIC PEPTIDES AND THEIR USE IN 
GENE THERAPY 

(iii) NUMBER OF SEQUENCES: 43 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Connolly, Bove, Lodge, & Hutz 

(B) STREET: 1220 Market Street, P.O. Box 2207 

(C) CITY: Wilmington 

(D) STATE: Delaware 

(E) COUNTRY: U.S.A. 

(F) ZIP : 19899 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk (provided in parent application) 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Not yet assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/985,526 

(B) FILING DATE: 5-DEC-1997 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: McMorrow Jr., Robert G 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (302) 658-9141 

(B) TELEFAX: (302) 658-5613 



(2) INFORMATION FOR SEQ ID NO: 1 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

Met Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro 
15 10 15 

Leu Cys Tyr His Asn Gly Val Gin Tyr Arg Asn Asn Glu Glu Trp Thr 
20 25 30 

Val Asp Ser Cys Thr Glu Cys His Cys Gin Asn Ser Val Thr He Cys 
35 40 45 

Lys Lys Val Ser Cys Pro He Met Pro Cys Ser Asn Ala Thr Val Pro 
50 55 60 

Asp Gly Glu Cys Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp 
65 70 75 80 

Gly Trp Ser Pro Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly 
85 90 95 

Asn Gly lie Gin Gin Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Arg 
100 105 HO 

Cys Glu Gly Ser Ser Val Gin Thr Arg Thr Cys His He Gin Glu Cys 
115 120 125 

Asp Lys Arg Phe Lys Gin Asp Gly Gly Trp Ser His Trp Ser Pro Trp 
130 " 135 140 

Ser Ser Cys Ser Val Thr Cys Gly Asp Gly Val lie Thr Arg He Thr 
145 * 150 155 160 

Asn Leu Cys Ser Pro Ser Pro Gin Met Asn Gly Lys Pro Cys Glu Gly 
165 170 175 

Arg Glu Ala Glu Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro He Asn 
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185 



190 



Gly Gly Tip Gly Pro Trp Ser Pro Trp Asp lie Cys Ser Val Thr Cys 
195 200 205 



Gly Gly Gly Val Gin Lys Arg Ser Arg Leu 
210 215 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

ATGACTGAAG AGAACAAAGA GTTGGCCAAT GAGCTGAGGC GGCCTCCCCT 
ATGCTATCAC 60 

AACGGAGTTC AGTACAGAAA TAACGAGGAA TGGACTGTTG ATAGCTGCAC 
TGAGTGTCAC 120 

TGTCAGAACT CAGTTACCAT CTGCAAAAAG GTGTCCTGCC CCATCATGCC 
CTGCTCCAAT 180 

GCCACAGTTC CTGATGGAGA ATGCTGTCCT CGCTGTTGGC CCAGCGACTC 
TGCGGACGAT 240 

GGCTGGTCTC CATGGTCCGA GTGGACCTCC TGTTCTACGA GCTGTGGCAA 
TGGAATTCAG 300 

CAGCGCGGCC GCTCCTGCGA TAGCCTCAAC AACCGATGTG AGGGCTCCTC 
GGTCCAGACA 360 

CGGACCTGCC ACATTCAGGA GTGTGACAAA AGATTTAAAC AGGATGGTGG 
CTGGAGCCAC 420 

TGGTCCCCGT GGTCATCTTG TTCTGTGACA TGTGGTGATG GTGTGATCAC 
AAGGATCCGG 480 
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CTCTGCAACT CTCCCAGCCC CCAGATGAAT GGGAAACCCT GTGAAGGCGA 
AGCGCGGGAG 540 

ACCAAAGCCT GCAAGAAAGA CGCCTGCCCC ATCAATGGAG GCTGGGGTCC 
TTGGTCACCA 600 

TGGGACATCT GTTCTGTCAC CTGTGGAGGA GGGGTAC AGA AACGTAGTCG 
TCTCTAA 656 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro 
15 10 15 

Leu Cys Tyr His Asn Gly Val Gin Tyr Arg Asn Asn Glu Glu Tip Thr 
20 25 30 

Asp Val Ser Cys Thr Glu Cys His Cys Gin Asn Ser Val Thr He Cys 
35 40 45 

Lys Lys Val Ser Cys Pro He Met Pro Cys Ser Asn Ala Thr Val Pro 
50 55 60 

Asp Gly Glu Cys Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp 
65 70 75 80 

Trp Gly Ser Pro Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly 
85 90 95 

Gly Asn He Gin Gin Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Arg 
100 105 " 110 

Cys Glu Gly Ser Ser Val Gin Thr Arg Thr Cys His He Gin Glu Cys 
115 120 125 
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Asp Lys Arg Phe Lys Gin Asp Gly Gly Trp Ser His Trp Ser Pro Trp 
130 135 140 

Ser Ser Cys Ser Val Thr Cys Gly Asp Gly Val He Thr Axg He Thr 
145 150 155 160 

Leu Cys Asn Ser Pro Ser Pro Gin Met Asn Gly Lys Pro Cys Glu Gly 
165 170 175 

Glu Ala Arg Glu Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro He Asn 
180 185 190 

Gly Gly Trp Gly Pro Trp Ser Pro Trp Asp He Cys Ser Val Thr Cys 
195 200 205 

Gly Gly Gly Val Gin Lys Arg Ser Arg Leu Cys Val Asp Ser Arg Met 
210 215 220 

Thr Glu Glu Asn Lys Glu Leu Ala Asn Glu Leu Arg Arg Pro Pro Leu 
225 230 235 240 

Cys Tyr His Asn Gly Val Gin Tyr Arg Asn Asn Glu Glu Trp Thr Val 
245 250 255 

Asp Ser Cys Thr Glu Cys His Cys Gin Asn Ser Val Thr He Cys Lys 
260 265 270 

Lys Val Ser Cys Pro He Met Pro Cys Ser Asn Ala Thr Val Pro Asp 
275 280 285 

Gly Glu Cys Cys Pro Arg Cys Trp Pro Ser Asp Ser Ala Asp Asp Gly 
290 ' 295 300 

Trp Ser Pro Trp Ser Glu Trp Thr Ser Cys Ser Thr Ser Cys Gly Asn 
305 310 315 320 

Gly He Gin Gin Arg Gly Arg Ser Cys Asp Ser Leu Asn Asn Arg Cys 
325 330 335 

Glu Gly Ser Ser Val Gin Thr Arg Thr Cys His He Gin Glu Cys Asp 
340 345 350 

Lys Arg Phe Lys Gin Asp Gly Gly Trp Ser His Trp Ser Pro Trp Ser 



355 



360 



365 
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Ser Cys Ser Val Thr Cys Gly Asp Gly Val lie Thr Arg He Thr Leu 
370 375 380 

Cys Asn Ser Pro Ser Pro Gin Met Asn Gly Lys Pro Cys Glu Gly Glu 
385 390 395 400 

Ala Arg Glu Thr Lys Ala Cys Lys Lys Asp Ala Cys Pro He Asn Gly 
405 410 415 

Gly Trp Gly Pro Trp Ser Pro Trp Asp He Cys Ser Val Thr Cys Gly 
420 425 430 

Gly Gly Val Gin Lys Arg Ser Arg Leu 
435 440 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ATGACTGAAG AGAACAAAGA GTTGGCCAAT GAGCTGAGGC GGCCTCCCCT 
ATGCTATCAC 60 

AACGGAGTTC AGTACAGAAA TAACGAGGAA TGGACTGTTG ATAGCTGCAC 
TGAGTGTCAC 120 

TGTCAGAACT CAGTTACCAT CTGCAAAAAG GTGTCCTGCC CCATCATGCC 
CTGCTCCAAT 180 

GCCACAGTTC CTGATGGAGA ATGCTGTCCT CGCTGTTGGC CCAGCGACTC 
TGCGGACGAT 240 

GGCTGGTCTC CATGGTCCGA GTGGACCTCC TGTTCTACGA GCTGTGGCAA 
TGGAATTCAG 300 

C AGCGCGGCC GCTCCTGCGA TAGCCTCAAC AACCGATGTG AGGGCTCCTC 



GGTCCAGACA 360 

CGGACCTGCC ACATTCAGGA GTGTGACAAA AGATTTAAAC AGGATGGTGG 
CTGGAGCCAC 420 

TGGTCCCCGT GGTCATCTTG TTCTGTGACA TGTGGTGATG GTGTGATCAC 
AAGGATCCGG 480 

CTCTGCAACT CTCCCAGCCC CCAGATGAAT GGGAAACCCT GTGAAGGCGA 
AGCGCGGGAG 540 

ACCAAAGCCT GCAAGAAAGA CGCCTGCCCC ATCAATGGAG GCTGGGGTCC 
TTGGTCACCA 600 

TGGGACATCT GTTCTGTCAC CTGTGGAGGA GGGGTACAGA AACGTAGTCG 
TCTCTGCGTC 660 

GACTCTAGAA TGACTGAAGA GAACAAAGAG TTGGCCAATG AGCTGAGGCG 
GCCTCCCCTA 720 

TGCTATCACA ACGGAGTTCA GTACAGAAAT AACGAGGAAT GGACTGTTGA 
TAGCTGCACT 780 

GAGTGTCACT GTCAGAACTC AGTTACCATC TGCAAAAAGG TGTCCTGCCC 
CATCATGCCC 840 

TGCTCC AATG CCACAGTTCC TGATGGAGAA TGCTGTCCTC GCTGTTGGCC 
CAGCGACTCT 900 

GCGGACGATG GCTGGTCTCC ATGGTCCGAG TGGACCTCCT GTTCTACGAG 
CTGTGGCAAT 960 

GGAATTCAGC AGCGCGGCCG CTCCTGCGAT AGCCTCAACA ACCGATGTGA 
GGGCTCCTCG 1020 

GTCCAGACAC GG ACCTGCCA CATTCAGGAG TGTGACAAAA GATTTAAACA 
GGATGGTGGC 1080 

TGGAGCCACT GGTCCCCGTG GTCATCTTGT TCTGTGACAT GTGGTGATGG 
TGTGATCACA 1140 

AGGATCCGGC TCTGCAACTC TCCCAGCCCC CAGATGAATG GGAAACCCTG 
TGAAGGCGAA 1200 

GCGCGGGAGA CCAAAGCCTG CAAGAAAGAC GCCTGCCCCA TCAATGGAGG 

45 



CTGGGGTCCT 1260 

TGGTCACCAT GGGACATCTG TTCTGTCACC TGTGGAGGAG GGGTACAGAA 
ACGTAGTCGT 1320 

CTCTAA 1326 
(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

MetTyrlleGlySerArg 
1 5 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 
GTCGACATGT ATATTGGTTC TCGTTAAGTC GAC 
(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Tyr He Gly Ser Arg Gly Lys Ser Tyr He Gly Ser Arg Gly Lys 
15 10 15 

Ser Tyr He Gly Ser Arg Gly Lys Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ED NO:8: 

GTCGACATGT ATATTGGTTC TCGTGTAAAA GTTATATTGG TTCTCGTGGT 
AAAAGTTATA 60 

TTGGTTCTCG TGGTAAAAGT TAAGTCGACC 90 
(2) INFORMATION FOR SEQ ED NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ED NO:9: 

Met Leu Tyr Lys Lys He He Lys Lys Leu Leu Glu Ser 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

GTCGACATGC TTTATAAGAA GATCATCAAG AAGCTTCTTG AGAGTTAAGT CGAC 
54 

(2) INFORMATION FOR SEQ ID NO: 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 1 : 

Met Leu Tyr Lys Lys He lie Lys Lys Leu Leu Glu Ser Gly Lys Ser 
1 5 10 15 

Leu Tyr Lys Lys lie He Lys Lys Leu Leu Glu Ser Gly Lys Ser Leu 
20 25 30 

Tyr Lys Lys He He Lys Lys Leu Leu Glu Ser Gly Lys Ser 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTCGACATGC TTTATAAGAA GATCATCAAG AAGCTTCTTG AGAGTGGTAA 
AAGTCTTTAT 60 

AAGAAGATCA TCAAGAAGCT TCTTGAGAGT GGTAAAAGTC TTTATAAGAA 
GATCATCAAG 120 

O AAGCTTCTTG AGAGTGGTAA AAGTTAAGTC GAC 153 

O 

W (2) INFORMATION FOR SEQ ID NO: 1 3 : 

en 

W (i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 9 amino acids 

m (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

ru 
i 

c „ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Met Phe Cys Tyr Trp Lys Val Cys Trp 
1 5 

(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ED NO: 14: 
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GTCGACATGT TCTGTTATTG GAAGGTTTGT TGGTAAGTCG AC 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Phe Cys Tyr Tip Lys Val Cys Trp Gly Lys Ser Phe Cys Tyr Trp 
1 5 10 15 

Lys Val Cys Trp Gly Lys Ser Phe Cys Tyr Trp Lys Val Cys Trp Gly 
20 25 30 

Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GTCGACATGT TCTGTTATTG GAAGGTTTGT TGGGGTAAAA GTTTCTGTTA 
TTGGAAGGTT 60 

TGTTGGGGTA AAAGTTTCTG TTATTGGAAG GTTTGTTGGG GTAAAAGTTA 
AGTCGAC 117 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

MetGly ArgGlyAsp 
1 5 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GTCGAC ATGG GTCGTGGTGA TT AAGTCGAC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Gly Arg Gly Asp Gly Lys Ser Gly Arg Gly Asp Gly Lys Ser Gly 
15 10 15 
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Arg Gly Asp Gly Lys Ser 
20 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



t 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

W 

0 1 GTCGACATGG GTCGTGGTGA TGGTAAAAGT GGTCGTGGTG ATGGTAAAAG 

? TGGTCGTGGT 60 

yi 

m GATGGTAAAA GTTAAGTCGA C 81 

3 

£ (2) INFORMATION FOR SEQ ID NO:21 : 

hi 

J (i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 210 amino acids 

H (B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Met Val Tyr Leu Ser Glu Cys Lys Thr Gly lie Gly Asn Gly Tyr Arg 
15 10 15 

Gly Thr Met Ser Arg Thr Lys Ser Gly Val Ala Cys Gin Lys Trp Gly 
20 25 30 

Ala Thr Phe Pro His Val Pro Asn Tyr Ser Pro Ser Thr His Pro Asn 
35 40 45 

Glu Gly Leu Glu Glu Asn Tyr Cys Arg Asn Pro Asp Asn Asp Glu Gin 
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Gly Pro Tip Cys Tyr Thr Thr Asp Pro Asp Lys Arg Tyr Asp Tyr Cys 
65 70 75 80 

Asn He Pro Glu Cys Glu Glu Glu Cys Met Tyr Cys Ser Gly Glu Lys 
85 90 95 

Tyr Glu Gly Lys lie Ser Lys Thr Met Ser Gly Lys Asp Cys Gin Ala 
100 105 HO 

Trp Asp Ser Gin Ser Pro His Ala His Gly Tyr Be Pro Ala Lys Phe 
115 120 125 

Pro Ser Lys Asn Leu Lys Met Asn Tyr Cys His Asn Pro Asp Gly Glu 
130 135 140 

Pro Arg Pro Trp Cys Phe Thr Thr Asp Pro Thr Lys Arg Trp Glu Tyr 
145 150 155 160 

Cys Asp lie Pro Arg Cys Thr Thr Pro Pro Pro Pro Pro Ser Pro Thr 
165 170 175 

Tyr Gin Cys Leu Lys Gly Arg Gly Glu Asn Tyr Arg Gly Thr Val Ser 
180 185 190 

Val Thr Val Ser Gly Lys Thr Cys Gin Arg Trp Ser Glu Gin Thr Pro 
195 200 205 

His Arg 
210 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
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GTCGACATGG TGTATCTGTC AGAATGTAAG ACCGGCATCG GCAACGGCTA 
CAGAGGAACC 60 

ATGTCCAGGA CAAAGAGTGG TGTTGCCTGT CAAAAGTGGG GTGCCACGTT 
CCCCCACGTA 120 

CCCAACTACT CTCCCAGTAC ACATCCCAAT GAGGGACTAG AAGAGAACTA 
CTGTAGGAAC 180 

CCAGACAATG ATGAACAAGG GCCTTGGTGC TACACTACAG ATCCGGACAA 
GAGATATGAC 240 

TACTGCAACA TTCCTGAATG TGAAGAGGAA TGCATGTACT GCAGTGGAGA 
AAAGTATGAG 300 

GGCAAAATCT CCAAGACCAT GTCTGGACTT GACTGCCAGG CCTGGGATTC 
TCAGAGCCCA 360 

CATGCTCATG GATACATCCC TGCCAAATTT CCAAGCAAGA ACCTGAAGAT 
GAATTATTGC 420 

CACAACCCTG ACGGGGAGCC AAGGCCCTGG TGCTTCACAA CAGACCCCAC 
CAAACGCTGG 480 

GAATACTGTG ACATCCCCCG CTGCACAACA CCCCCGCCCC CACCCAGCCC 
AACCTACCAA 540 

TGTCTGAAAG GAAGAGGTGA AAATTACCGA GGGACCGTGT CTGTCACCGT 
GTCTGGGAAA 600 

ACCTGTCAGC GCTGGAGTGA GCAAACCCCT CATAGGTGAG TCGAC 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
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Met Val Tyr Leu Ser Glu Cys Lys Thr Gly He Gly Asn Gly Tyr Arg 
15 10 15 

Gly Thr Met Ser Arg Thr Lys Ser Gly Val Ala Cys Gin Lys Trp Gly 
20 25 30 

Ala Thr Phe Pro His Val Pro Asn Tyr Ser Pro Ser Thr His Pro Asn 
35 40 45 

Glu Gly Leu Glu Glu Asn Tyr Cys Arg Asn Pro Asp Asn Asp Glu Gin 
50 55 60 

Gly Pro Trp Cys Tyr Thr Thr Asp Pro Asp Lys Arg Tyr Asp Tyr Cys 
65 70 75 80 

Asn lie Pro Glu Cys Glu Glu Glu Cys Met Tyr Cys Ser Gly Glu Lys 
85 90 95 

Tyr Glu Gly Lys He Ser Lys Thr Met Ser Gly Lys Asp Cys Gin Ala 
100 105 110 

Trp Asp Ser Gin Ser Pro His Ala His Gly Tyr He Pro Ala Lys Phe 
115 120 125 

Pro Ser Lys Asn Leu Lys Met Asn Tyr Cys His Asn Pro Asp Gly Glu 
130 135 140 

Pro Arg Pro Trp Cys Phe Thr Thr Asp Pro Thr Lys Arg Trp Glu Tyr 
145 150 155 160 

Cys Asp He Pro Arg Cys Thr Thr Pro Pro Pro Pro Pro Ser Pro Thr 
165 170 175 

Tyr Gin Cys Leu Lys Gly Arg Gly Glu Asn Tyr Arg Gly Thr Val Ser 
180 185 190 

Val Thr Val Ser Gly Lys Thr Cys Gin Arg Trp Ser Glu Gin Thr Pro 
195 200 205 

His Arg Gly Lys Ser Met Val Tyr Leu Ser Glu Cys Lys Thr Gly lie 
210 215 220 

Gly Asn Gly Tyr Arg Gly Thr Met Ser Arg Thr Lys Ser Gly Val Ala 
225 230 235 240 
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Cys Gin Lys Trp Gly Ala Thr Phe Pro His Val Pro Asn Tyr Ser Pro 
245 250 255 

Ser Thr His Pro Asn Glu Gly Leu Glu Glu Asn Tyr Cys Arg Asn Pro 
260 265 270 

Asp Asn Asp Glu Gin Gly Pro Tip Cys Tyr Thr Thr Asp Pro Asp Lys 
275 280 285 

Arg Tyr Asp Tyr Cys Asn He Pro Glu Cys Glu Glu Glu Cys Met Tyr 
290 295 300 

Cys Ser Gly Glu Lys Tyr Glu Gly Lys He Ser Lys Thr Met Ser Gly 
305 310 315 320 

Lys Asp Cys Gin Ala Trp Asp Ser Gin Ser Pro His Ala His Gly Tyr 
325 330 335 

He Pro Ala Lys Phe Pro Ser Lys Asn Leu Lys Met Asn Tyr Cys His 
340 345 350 

Asn Pro Asp Gly Glu Pro Arg Pro Trp Cys Phe Thr Thr Asp Pro Thr 
355 360 365 

Lys Arg Trp Glu Tyr Cys Asp He Pro Arg Cys Thr Thr Pro Pro Pro 
370 375 380 

Pro Pro Ser Pro Thr Tyr Gin Cys Leu Lys Gly Arg Gly Glu Asn Tyr 
385 390 395 400 

Arg Gly Thr Val Ser Val Thr Val Ser Gly Lys Thr Cys Gin Arg Trp 
405 410 415 

Ser Glu Gin Thr Pro His Arg 
420 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

GTCGACATGG TGTATCTGTC AGAATGTAAG ACCGGCATCG GCAACGGCTA 
CAGAGGAACC 60 

ATGTCCAGGA CAAAGAGTGG TGTTGCCTGT C AAAAGTGGG GTGCCACGTT 
CCCCCACGTA 120 

CCCAACTACT CTCCCAGTAC ACATCCCAAT GAGGGACTAG AAGAGAACTA 
CTGTAGGAAC 180 

CCAGACAATG ATGAACAAGG GCCTTGGTGC TACACTACAG ATCCGGACAA 
GAGATATGAC 240 

TACTGCAACA TTCCTGAATG TGAAGAGGAA TGCATGTACT GCAGTGGAGA 
AAAGTATGAG 300 

GGCAAAATCT CCAAGACCAT GTCTGGACTT GACTGCCAGG CCTGGGATTC 
TCAGAGCCCA 360 

CATGCTCATG GATACATCCC TGCCAAATTT CCAAGCAAGA ACCTGAAGAT 
GAATTATTGC 420 

CACAACCCTG ACGGGGAGCC AAGGCCCTGG TGCTTCACAA CAGACCCCAC 
CAAACGCTGG 480 

GAATACTGTG ACATCCCCCG CTGCACAACA CCCCCGCCCC CACCCAGCCC 
AACCTACCAA 540 

TGTCTGAAAG GAAGAGGTGA AAATTACCGA GGGACCGTGT CTGTCACCGT 
GTCTGGGAAA 600 

ACCTGTCAGC GCTGGAGTGA GCAAACCCCT CATAGGGGTA AAAGTATGGT 
GTATCTGTCA 660 

GAATGTAAGA CCGGCATCGG CAACGGCTAC AGAGGAACCA TGTCCAGGAC 
AAAGAGTGGT 720 

GTTGCCTGTC AAAAGTGGGG TGCCACGTTC CCCCACGTAC CCAACTACTC 
TCCCAGTACA 780 

CATCCCAATG AGGGACTAGA AGAGAACTAC TGTAGGAACC CAGACAATGA 
TGAACAAGGG 840 
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CCTTGGTGCT ACACTACAGA TCCGGACAAG AGATATGACT ACTGCAACAT 
TCCTGAATGT 900 

GAAGAGGAAT GCATGTACTG CAGTGGAGAA AAGTATGAGG GCAAAATCTC 
CAAGACCATG 960 

TCTGGACTTG ACTGCCAGGC CTGGGATTCT CAGAGCCCAC ATGCTCATGG 
ATACATCCCT 1020 

GCCAAATTTC CAAGCAAGAA CCTGAAGATG AATTATTGCC ACAACCCTGA 
CGGGGAGCCA 1080 

AGGCCCTGGT GCTTCACAAC AGACCCCACC AAACGCTGGG AATACTGTGA 
CATCCCCCGC 1140 

TGCACAACAC CCCCGCCCCC ACCCAGCCCA ACCTACCAAT GTCTGAAAGG 
AAGAGGTGAA 1200 

AATTACCGAG GGACCGTGTC TGTCACCGTG TCTGGGAAAA CCTGTCAGCG 
CTGGAGTGAG 1260 

C AAACCCCTC ATAGGTGAGT CGAC 1 284 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Met Leu Pro He Cys Pro Gly Gly Ala Ala Arg Cys Gin Val Thr Leu 
15 10 15 

Arg Glu Leu Phe Asp Arg Ala Val Val Leu Ser His Tyr lie His Asn 
20 25 30 

Leu Ser Ser Glu Met Phe Ser Glu Phe Glu Lys Arg Tyr Thr His Gly 
35 40 45 
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Arg Gly Phe He Thr Lys Ala He Asn Ser Cys His Thr Ser Ser Leu 
50 55 60 



Ala Thr Pro Glu Asp Lys Glu Gin Ala Gin Gin Met Asn Gin Lys Asp 
65 70 75 80 



Phe Leu Ser Leu He Val Ser He Leu Arg Ser Trp Asn Glu Pro Leu 
85 90 95 



Tyr His Leu Val Thr Glu Val Arg Gly Met Gin Glu Ala Pro Gin Ala 
100 105 110 



He Leu Ser Lys Ala Val Glu He Glu Glu Gin Thr Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

GTCGACATGT TGCCCATCTG TCCCGGCGGG GCTGCCCGAT GCCAGGTGAC 
CCTTCGAGAC 60 

CTGTTTGACC GCGCCGTCGT CCTGTCCCAC TACATCCATA ACCTCTCCTC 
AGAAATGTTC 120 

AGCGAATTCG ATAAACGGTA TACCCATGGC CGGGGGTTCA TTACCAAGGC 
CATCAACAGC 180 

TGCCACACTT CTTCCCTTGC CACCCCCGAA GACAAGGAGC AAGCCCAACA 
GATGAATCAA 240 

AAAGACTTTC TG AGCCTGAT AGTCAGC ATA TTGCGATCCT GGAATGAGCC 
TCTGTATCAT 300 

CTGGTCACGG AAGTACGTGG TATGCAAGAA GCCCCGGAGG CTATCCTATC 
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CAAAGCTGTA 360 

GAGATTGAGG AGCAAACCAA ATAAGTCGAC 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



00 



| (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Leu Pro lie Cys Pro Gly Gly Ala Ala Arg Cys Gin Val Thr Leu 
15 10 15 

Arg Glu Leu Phe Asp Arg Ala Val Val Leu Ser His Tyr lie His Asn 
3 20 25 30 

^ Leu Ser Ser Glu Met Phe Ser Glu Phe Glu Lys Arg Tyr Thr His Gly 

ft) 35 40 45 

S Arg Gly Phe lie Thr Lys Ala lie Asn Ser Cys His Thr Ser Ser Leu 
" 50 55 60 

Ala Thr Pro Glu Asp Lys Glu Gin Ala Gin Gin Met Asn Gin Lys Asp 
65 70 75 80 

Phe Leu Ser Leu lie Val Ser He Leu Arg Ser Trp Asn Glu Pro Leu 
85 90 95 

Tyr His Leu Val Thr Glu Val Arg Gly Met Gin Glu Ala Pro Gin Ala 
100 105 HO 

He Leu Ser Lys Ala Val Glu He Glu Glu Gin Thr Lys Gly Lys Ser 
115 120 125 

Met Leu Pro He Cys Pro Gly Gly Ala Ala Arg Cys Gin Val Thr Leu 
130 135 140 

Arg Glu Leu Phe Asp Arg Ala Val Val Leu Ser His Tyr lie His Asn 
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145 150 155 160 

Leu Ser Ser Glu Met Phe Ser Glu Phe Glu Lys Arg Tyr Thr His Gly 
165 170 175 

Arg Gly Phe He Thr Lys Ala He Asn Ser Cys His Thr Ser Ser Leu 
180 185 190 

Ala Thr Pro Glu Asp Lys Glu Gin Ala Gin Gin Met Asn Gin Lys Asp 
195 200 205 

Phe Leu Ser Leu He Val Ser He Leu Arg Ser Trp Asn Glu Pro Leu 
210 215 220 

Tyr His Leu Val Thr Glu Val Arg Gly Met Gin Glu Ala Pro Gin Ala 
225 230 235 240 

He Leu Ser Lys Ala Val Glu lie Glu Glu Gin Thr Lys 
245 250 

(2) INFORMATION FOR SEQID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

GTCGACATGT TGCCCATCTG TCCCGGCGGG GCTGCCCGAT GCCAGGTGAC 
CCTTCGAGAC 60 

CTGTTTGACC GCGCCGTCGT CCTGTCCC AC TACATCCATA ACCTCTCCTC 
AGAAATGTTC 120 

AGCGAATTCG ATAAACGGTA TACCCATGGC CGGGGGTTCA TTACCAAGGC 
CATCAACAGC 180 

TGCCACACTT CTTCCCTTGC CACCCCCGAA GACAAGGAGC AAGCCCAACA 
GATGAATCAA 240 
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AAAGACTTTC TGAGCCTGAT AGTCAGCATA TTGCGATCCT GGAATGAGCC 
TCTGTATCAT 300 

CTGGTCACGG AAGTACGTGG TATGCAAGAA GCCCCGGAGG CTATCCTATC 
CAAAGCTGTA 360 

GAGATTGAGG AGCAAACCGG TAAAAGTATG TTGCCCATCT GTCCCGGCGG 
GGCTGCCCGA 420 

TGCCAGGTGA CCCTTCGAGA CCTGTTTGAC CGCGCCGTCG TCCTGTCCCA 
CTACATCCAT 480 

AACCTCTCCT CAGAAATGTT CAGCGAATTC GATAAACGGT ATACCCATGG 
CCGGGGGTTC 540 

ATTACCAAGG CCATCAACAG CTGCCACACT TCTTCCCTTG CCACCCCCGA 
AGACAAGGAG 600 

CAAGCCCAAC AGATGAATCA AAAAGACTTT CTGAGCCTGA TAGTCAGCAT 
ATTGCGATCC 660 

TGGAATGAGC CTCTGTATCA TCTGGTCACG GAAGTACGTG GTATGCAAGA 
AGCCCCGGAG 720 

GCTATCCTAT CCAAAGCTGT AGAGATTGAG GAGCAAACCA AATAAGTCGA C 
771 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

ATGCTGAGGC GGCCTCCCCT ATGCTATCAC AACGGAGTTC AGTACAGAAA 
TAACGGTAAA 60 

AGATCCCCGT GGTCATCTTG TTCTGTGACA TGTGGTGATG GTGTGATGGT 
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AAAAGAAGTG 120 

GTACCCTGTA GACAAGACAG TGGACACCTC CTCCCCATTA A 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

I Met Leu Arg Arg Pro Pro Leu Cys Tyr His Asn Gly Val Gin Tyr Arg 

00 15 10 15 

CP 

*° Asn Asn Glu Glu Trp Thr Val Asp Ser Gly Lys Ser Ser Pro Trp Ser 

f : 20 25 30 

K Ser Cys Ser Val Thr Cys Gly Asp Gly Val lie Thr Arg He Gly Lys 
S 35 40 45 

1 Ser Ser Pro Trp Asp He Cys Ser Val Thr Cys Gly Gly Gly Val 

50 55 60 

(2) INFORMATION FOR SEQ ID NO:3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

ATGCTGAGGC GGCCTCCCCT ATGCTATCAC AACGGAGTTC AGTAC AGAAA 
TAACGGTAAA 60 
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AGATCCCCGT GGTCATCTTG TTCTGTGACA TGTGGTGATG GTGTGATGGT 
AAAAGAAGTG 120 

GTACCCTGTA GACAAGACAG TGGACACCTC CTCCCCATTA TATTGGTTCT 
CGTGGTAAAA 180 

GATAA 185 
(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 1 base pairs 

(B) TYPE: nucleic acid 

u (C) STRANDEDNESS: single 

q (D) TOPOLOGY: linear 

w 

CP 

w 
pi 

i (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

\*. TAGGTCTAGAATGACTGAAGAGAACAAAGAG 31 

% (2) INFORMATION FOR SEQ ID NO:33: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
ATGGTCTAGA TTAGAGACGA CTACGTTTCT G 
(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Glu Ser Lys Ala Leu Leu Ala Val Ala Leu Trp Phe Cys Val Glu 
15 10 15 

Thr Arg Ala Ala Ser Val Gly Leu Pro Gly Asp Phe Leu His Pro Pro 
20 25 30 

Lys Leu Ser Thr Gin Lys Asp He Leu Thr lie Leu Ala Asn Thr Thr 
35 40 * 45 

Leu Gin He Thr Cys Arg Gly Gin Arg Asp Leu Asp Trp Leu Trp Pro 
50 55 60 

Asn Ala Gin Arg Asp Ser Glu Glu Arg Val Leu Val Thr Glu Cys Gly 
65 70 75 80 

Gly Gly Asp Ser He Phe Cys Lys Thr Leu Thr He Pro Arg Val Val 
85 90 95 

Gly Asn Asp Thr Gly Ala Tyr Lys Cys Ser Tyr Arg Asp Val Asp He 
100 105 110 

Ala Ser Thr Val Tyr Val Tyr Val Arg Asp Tyr Arg Ser Pro Phe He 
115 120 125 

Ala Ser Val Ser Asp Gin His Gly He Val Tyr lie Thr Glu Asn Lys 
130 135 140 

Asn Lys Thr Val Val He Pro Cys Arg Gly Ser He Ser Asn Leu Asn 
145 ' 150 155 160 

Val Ser Leu Cys Ala Arg Tyr Pro Glu Lys Arg Phe Val Pro Asp Gly 
165 170 175 

Asn Arg He Ser Tip Asp Ser Glu He Gly Phe Thr Leu Pro Ser Tyr 
180 185 190 

Met lie Ser Tyr Ala Gly Met Val Phe Cys Glu Ala Lys He Asn Asp 
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195 200 205 

Glu Thr Tyr Gin Ser He Met Tyr He Val Val Val Val Gly Tyr Arg 
210 215 220 

He Tyr Asp Val He Leu Ser Pro Pro His Glu lie Glu Leu Ser Ala 
225 * 230 235 240 

Gly Glu Lys Leu Val Leu Asn Cys Thr Ala Arg Thr Glu Leu Asn Val 
245 250 255 

Gly Leu Asp Phe Thr Trp His Ser Pro Pro Ser Lys Ser His His Lys 
260 265 270 

Lys lie Val Asn Arg Asp Val Lys Pro Phe Pro Gly Thr Val Ala Lys 



275 280 285 

Met Phe Lys Ser Thr Leu Thr lie Glu Ser Val Thr Lys Ser Asp Gin 
290 295 300 

Gly Glu Tyr Thr Cys Val Ala Ser Ser Gly Arg Met He Lys Arg Asn 
305 * 310 315 320 

Arg Thr Phe Val Arg Val His Thr Lys Pro Phe He Ala Phe Gly Ser 
325 330 335 

Gly Met Lys Ser Leu Val Glu Ala Thr Val Gly Ser Gin Val Arg He 
340 345 350 

Pro Val Lys Tyr Leu Ser Tyr Pro Ala Pro Asp He Lys Trp Tyr Arg 
355 360 365 

Asn Gly Arg Pro He Glu Ser Asn Tyr Thr Met He Val Gly Asp Glu 
370 * 375 380 

Leu Thr He Met Glu Val Thr Glu Arg Asp Ala Gly Asn Tyr Thr Val 
385 390 395 400 

He Leu Thr Asn Pro He Ser Met Glu Lys Gin Ser His Met Val Ser 
405 410 415 

Leu Val Val Asn Val Pro Pro Gin He Gly Glu Lys Ala Leu He Ser 
420 425 430 
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Pro Met Asp Ser Tyr Gly Tyr Gly Thr Met Gin Thr Leu Thr Cys Thr 
435 440 445 

Val Tyr Ala Asn Pro Pro Leu His His He Gin Trp Tyr Trp Gin Leu 
450 455 460 

Glu Glu Ala Cys Ser Tyr Arg Pro Gly Gin Thr Ser Pro Tyr Ala Gys 
465 470 475 480 

Lys Glu Trp Arg His Val Glu Asp Phe Gin Gly Gly Asn Lys He Glu 
485 490 495 

Val Thr Lys Asn Gin Tyr Ala Leu He Glu Gly Lys Asn Lys Thr Val 
500 505 510 

Ser Thr Leu Val He Gin Ala Ala Asn Val Ser Ala Leu Tyr Lys Cys 
515 520 525 

Glu Ala He Asn Lys Ala Gly Arg Gly Glu Arg Val He Ser Phe His 

530 535 540 

Val He Arg Gly Pro Glu He Thr Val Gin Pro Ala Ala Gin Pro Thr 
545 550 555 560 

Glu Gin Glu Ser Val Ser Leu Leu Cys Thr Ala Asp Arg Asn Thr Phe 
565 570 575 

Glu Asn Leu Thr Trp Tyr Lys Leu Gly Ser Gin Ala Thr Ser Val His 
580 585 590 

Met Gly Glu Ser Leu Thr Pro Val Cys Lys Asn Leu Asp Ala Leu Trp 
595 600 605 

Lys Leu Asn Gly Thr Met Phe Ser Asn Ser Thr Asn Asp He Leu He 
610 615 620 

Val Ala Phe Gin Asn Ala Ser Leu Gin Asp Gin Gly Asp Tyr Val Cys 
625 630 635 640 

Ser Ala Gin Asp Lys Lys Thr Lys Lys Arg His Cys Leu Val Lys Gin 
645 " 650 * 655 

Leu He He Leu Glu Arg Met Ala Pro Met lie Thr Gly Asn Leu Glu 
660 665 670 
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Asn Gin Thr Thr Thr lie Gly Glu Thr He Glu Val Thr Cys Pro Ala 
675 680 ' 685 

Ser Gly Asn Pro Thr Pro His lie Thr Trp Phe Lys Asp Asn Glu Thr 
690 695 700 

Leu Val Glu Asp Ser Gly lie Val Leu Arg Asp Gly Asn Arg Asn Leu 
705 710 * 715 720 

Thr He Arg Arg Val Arg Lys Glu Asp Gly Gly Leu Tyr Thr Cys Gin 
725 730 735 

Ala Cys Asn Val Leu Gly Cys Ala Arg Ala Glu Thr Leu Phe lie He 
740 745 750 

Glu Gly Ala Gin Glu Lys Thr Asn Leu Glu Val He He Leu Val Gly 
755 760 765 

Thr Ala Val He Ala Met Phe Phe Trp Leu Leu Leu Val He Leu Val 
770 775 780 

Arg Thr Val Lys Arg Ala Asn Glu Gly Glu Leu Lys Thr Gly Tyr Leu 
785 790 795 800 



Ser He Val Met Asp 
805 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

AGACGTCATG GAGAGCAAGG CGCTGCTAGC TGTCGCTCTG TGGTTCTGCG 
TGGAGACCCG 60 
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AGCCGCCTCT GTGGGTTTGC CTGGCGATTT TCTCCATCCC CCCAAGCTCA 
GCACACAGAA 120 

AGACATACTG ACAATTTTGG CAAATACAAC CCTTCAGATT ACTTGCAGGG 
GACAGCGGGA 180 

CCTGGACTGG CTTTGGCCCA ATGCTCAGCG TGATTCTGAG GAAAGGGTAT 
TGGTGACTGA 240 

ATGCGGCGGT GGTGACAGTA TCTTCTGCAA AACACTCACC ATTCCCAGGG 
TGGTTGGAAA 300 

TGATACTGGA GCCTACAAGT GCTCGTACCG GGACGTCGAC ATAGCCTCCA 
CTGTTTATGT 360 

CTATGTTCGA GATTACAGAT CACCATTCAT CGCCTCTGTC AGTGACCAGC 
ATGGCATCGT 420 

GTACATCACC GAGAACAAGA ACAAAACTGT GGTGATCCCC TGCCGAGGGT 
CGATTTCAAA 480 

CCTCAATGTG TCTCTTTGCG CTAGGTATCC AGAAAAGAGA TTTGTTCCGG 
ATGGAAAC AG 540 

AATTTCCTGG GACAGCGAGA TAGGCTTTAC TCTCCCCAGT TACATGATCA 
GCTATGCCGG 600 

CATGGTCTTC TGTGAGGCAA AGATCAATGA TGAAACCTAT CAGTCTATCA 
TGTACATAGT 660 

TGTGGTTGTA GGATATAGGA TTTATGATGT GATTCTGAGC CCCCCGCATG 
AAATTGAGCT 720 

ATCTGCCGGA GAAAAACTTG TCTTAAATTG TAC AGCGAGA ACAGAGCTCA 
ATGTGGGGCT 780 

TGATTTCACC TGGCACTCTC CACCTTCAAA GTCTCATCAT AAGAAGATTG 
TAAACCGGGA 840 

TGTGAAACCC TTTCCTGGGA CTGTGGCGAA GATGTTTTTG AGCACCTTGA 
CAATAGAAAG 900 

TGTGACCAAG AGTGACCAAG GGGAATACAC CTGTGTAGCG TCCAGTGGAC 
GGATGATCAA 960 
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GAGAAATAGA ACATTTGTCC GAGTTCACAC AAAGCCTTTT ATTGCTTTCG 
GTAGTGGGAT 1020 

GAAATCTTTG GTGGAAGCCA CAGTGGGCAG TCAAGTCCGA ATCCCTGTGA 
AGTATCTCAG 1080 

TTACCCAGCT CCTGATATCA AATGGTACAG AAATGGAAGG CCCATTGAGT 
CCAACTACAC 1140 

AATGATTGTT GGCGATGAAC TCACCATCAT GGAAGTGACT GAAAGAGATG 
CAGGAAACTA 1200 

CACGGTCATC CTCACCAACC CCATTTCAAT GGAGAAACAG AGCCACATGG 
TCTCTCTGGT 1260 

TGTGAATGTC CCACCCCAGA TCGGTGAGAA AGCCTTGATC TCGCCTATGG 
ATTCCTACCA 1320 

GTATGGGACC ATGCAGACAT TGACATGCAC AGTCTACGCC AACCCTCCCC 
TGCACCACAT 1380 

CCAGTGGTAC TGGCAGCTAG AAGAAGCCTG CTCCTACAGA CCCGGCCAAA 
CAAGCCCGTA 1440 

TGCTTGTAAA GAATGGAGAC ACGTGGAGGA TTTCCAGGGG GGAAACAAGA 
TCGAAGTCAC 1500 

CAAAAACCAA TATGCCCTGA TTGAAGGAAA AAACAAAACT GTAAGTACGC 
TGGTCATCCA 1560 

AGCTGCCAAC GTGTCAGCGT TGTACAAATG TGAAGCCATC AACAAAGCGG 
GACGAGGAGA 1620 

GAGGGTCATC TCCTTCCATG TGATCAGGGG TCCTGAAATT ACTGTGCAAC 
CTGCTGCCCA 1680 

GCCAACTGAG CAGGAGAGTG TGTCCCTGTT GTGCACTGCA GACAGAAATA 
CGTTTGAGAA 1740 

CCTCACGTGG TACAAGCTTG GCTCACAGGC AACATCGGTC CACATGGGCG 
AATCACTCAC 1800 

ACCAGTTTGC AAGAACTTGG ATGCTCTTTG GAAACTGAAT GGCACCATGT 
TTTCTAACAG 1860 



70 



CACAAATGAC ATCTTGATTG TGGCATTTCA GAATGGCTCT CTGCAGGACC 
AAGGCGACTA 1920 

TGTTTGCTCT GCTCAAGATA AGAAGACCAA GAAAAGACAT TGCCTGGTCA 
AACAGCTCAT 1980 

CATCCTAGAG CGCATGGCAC CCATGATCAC CGGAAATCTG GAGAATCAGA 
CAACAACCAT 2040 

TGGCGAGACC ATTGAAGTGA CTTGCCCAGC ATCTGGAAAT CCTACCCCAG 
ACATTACATG 2100 

GTTCAAAGAC AACGAGACCC TGGTAGAAGA TTCAGGCATT GTACTGAGAG 
ATGGGAACCG 2160 

GAACCTGACT ATCCGCAGGG TGAGGAAGGA GGATGGAGGC CTCTACACCT 
GCCAGGCCTG 2220 

CAATGTCCTT GGCTGTGCAA GAGCGGAGAC GCTCTTCATA ATAGAAGGTG 
CCCAGGAAAA 2280 

GACCAACTTG GAAGTCATTA TCCTCGTCGG CACTGCAGTG ATTGCCATGT 
TCTTCTGGCT 2340 

CCTTCTTGTC ATTCTCGTAC GGACCGTTAA GCGGGCCAAT GAAGGGGAAC 
TGAAGACAGG 2400 

CT ACTTGTCT ATTGTC ATGG ATT AAGACGT C 243 1 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met His Thr His Gin Asp Phe Gin Pro Val Leu His Leu Val Ala Leu 
15 10 15 
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Asn Thr Pro Leu Ser Gly Gly Met Arg Gly He Arg Gly Ala Asp Phe 
20 25 30 

Gin Cys Phe Asn Asn Ala Arg Val Gly Leu Ser Gly Thr Phe Arg Ala 
35 40 45 

Phe Leu Ser Ser Arg Leu Gin Asp Leu Tyr Ser lie Val Arg Arg Ala 
50 55 60 

Asp Arg Gly Ser Val Pro He Val Gin Asn Leu Arg Asp Glu Val Leu 
65 " 70 75 80 

Ser Pro Ser Tip Asp Ser Leu Phe Ser Gly Ser Gin Gly Gin Leu Gin 
85 90 95 

Pro Gly Ala Arg He Phe Ser Phe Asp Gly Arg Asp Val Leu Arg His 
100 105 110 

Pro Ala Trp Pro Gin Arg Ser Val Trp His Gly Ser Asp Pro Ser Gly 
115 120 125 

Arg Arg Leu Met Glu Ser Tyr Cys Glu Thr Trp Arg Thr Glu Thr Thr 
130 135 140 

Gly Ala Thr Gly Gin Ala Ser Ser Leu Leu Ser Gly Arg Leu Leu Glu 
145 150 155 160 

Gin Arg Ala Ala Ser Cys His Asp Ser Tyr He Val Leu Cys He Glu 
165 170 175 

Asn Ser Phe Met Thr Ser Phe Ser Arg 
180 185 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

AGACGTCATG CATACTCATC AGGACTTTCA GCCAGTGCTC CACCTGGTGG 
CACTGAACAC 60 

CCCCCTGTCT GGAGGCATGC GTGGTATCCG TGGAGCAGAT TTCCAGTGCT 
TCCAGCAAGC 120 

CCGAGCCGTG GGGCTGTCGG GCACCTTCCG GGCTTTCCTG TCCTCTAGGC 
TGCAGGATCT 180 

CTATAGCATC GTGCGCCGTG CTGACCGGGG GTCTGTGCCC ATCGTCAACC 
TGAAGGACGA 240 

jf GGTGCTATCT CCCAGCTGGG ACTCCCTGTT TTCTGGCTCC CAGGGTCAAC 

S TGCAACCCGG 300 

u 

% GGCCCGCATC TTTTCTTTTG ACGGCAGAGA TGTCCTGAGA CACCCAGCCT 

m GGCCGCAGAA 360 

1 OACCCTATCCCACC^CT^^ 
ACTGTGAGAC 420 



H- ATGGCGAACT GAAACTACTG GGGCTACAGG TCAGGCCTCC TCCCTGCTGT 



CCTGGAACAG AAAGCTGCGA GCTGCCACAA CAGCTACATC GTCCTGTGCA 
TTGAGAATAG 540 

CTTCATGACC TCTTTCTCCA AATAG 565 
(2) INFORMATION FOR SEQ ID NO:38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 



ry 

D 



CAGGCAGGCT 480 
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CTATCGTCGA CATGTATATT GGTTCTCGTT AAGTCGACCT ATC 
(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
GATAGGTCGA CTTAACGAGA ACCAATATAC ATGTCGACGA TAG 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
AGTATCTAGA ATGAGTGTAT CTGTCACAAT G 31 
(2) INFORMATION FOR SEQ ID NO:41 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 



GAATTCTAGA TCACCTATGA GGGGTTTGCT C 



31 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

CTATCGTCGA CATGTATATT GGTTCTCGTA AAAGATATAT TGGTTCTCGT 
GGTAAAAGAG 60 

ATGGTTCTCG TGGTAAAAGA TAAGTGACCT ATC 93 
(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GATAGGTCGA CTTAT 15 



76814 



75 



